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Abstract —We study the impact of delayed channel state 
information at the transmitters (CSIT) in two-unicast wireless 
networks with a layered topology and arbitrary connectivity. We 
introduce a technique to obtain outer bounds to the degrees-of- 
freedom (DoF) region through the new graph-theoretic notion 
of bottleneck nodes . Such nodes act as informational bottlenecks 
only under the assumption of delayed CSIT, and imply asym¬ 
metric DoF bounds of the form mD\ + D 2 < m. Combining 
this outer-bound technique with new achievability schemes, we 
characterize the sum DoF of a class of two-unicast wireless 
networks, which shows that, unlike in the case of instantaneous 
CSIT, the DoF of two-unicast networks with delayed CSIT can 
take an infinite set of values. 

I. Introduction 

Characterizing network capacity is one of the central 
problems in network information theory. While a general 
solution is still far in the horizon, considerable research 
progress has been attained in several different fronts. In 
particular, single-flow networks are well understood and 
known to obey max-flow min-cute type principles, both for 
the case of wireline networks [1], [2] and for the case of 
wireless networks [3]. 

When we consider multi-flow networks, however, the 
picture is much less clear. As a natural first case to con¬ 
sider, networks with two source-destination pairs, or two- 
unicast networks, have recently been the focus of significant 
attention [4]—[9], but complete capacity characterizations are 
still a distant goal. In fact, characterizing the capacity of 
two-unicast wireline networks is known to be as hard as 
the general /c-unicast wireline problem [10]. In the wireless 
setting, matters become even more challenging since signals 
transmitted at different nodes interfere with each other, 
causing the two information flows to mix. 

In an attempt to obtain first-order capacity approximations 
and capture the impact of interference in multi-flow wire¬ 
less networks, a number of recent works have focused on 
characterizing the degrees of freedom of different network 
configurations. Roughly speaking, the degrees of freedom 
(DoF) of a wireless network are the pre-log factor in the 
capacity expression, and can be thought of as the gain 
over time-sharing that can be obtained by carefully perform¬ 
ing interference management and simultaneous routing of 
the information flows. As a result of DoF studies, several 
new interference management techniques have recently been 
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introduced, and shown to provide significant performance 
gains over simple time-sharing approaches [6], [11]—[13]. In 
particular, a careful combination of interference avoidance, 
interference neutralization, interference alignment [11], [12], 
and aligned interference neutralization [6] was used in [8] to 
fully characterize the DoF of two-unicast layered wireless 
networks. 

However, the promised gains in these DoF character¬ 
izations come at a high price. In order to mitigate the 
effective interference experienced by the receivers, heavy 
coordination is required among all nodes in the network. In 
particular, instantaneous channel state information (CSI) is 
assumed to be available at every transmitter. In small-scale 
or slow-fading networks, the task of providing transmitters 
with CSI can be carried out with negligible overhead using 
feedback channels. However, as wireless networks grow in 
size, nodes turn mobile, and fast-fading channels become 
ubiquitous, providing up-to-date channel state information 
at the transmitters (CSIT) is practically infeasible, and CSIT 
is usually obtained with delay. So how are the DoF gains 
promised by interference management techniques in multi¬ 
flow wireless networks affected by delayed CSIT? 

Perhaps the first study of the impact of the delayed CSIT 
in wireless networks was [14] where the delayed knowledge 
was used to create transmit signals that are simultaneously 
useful for multiple users in a broadcast channel. These ideas 
were then extended to different settings. Some examples are 
the study of erasure broadcast channels [15] and the capacity 
results for erasure interference channels [16]—[19]. In the 
context of multiple-input single-output (MISO) Gaussian 
broadcast channels (BC), it was shown that the delayed CSIT 
can still be very useful and in fact change the achievable 
DoF [20]. This discovery generated a momentum in studying 
the DoF region of multi-antenna two-user Gaussian IC and X 
channel [21]-[23], k- user Gaussian IC and X channel [24], 
[25], and multi-antenna two-user Gaussian IC with delayed 
CSIT and Shannon feedback [26], [27]. 

In this work, we study the impact of delayed CSIT in 
multi-hop multi-flow wireless networks by focusing our 
attention on two-unicast layered networks with arbitrary 
connectivity. It is known that, in the case of instantaneous 
CSIT, the sum DoF of these networks can only take the 
values 1, 3/2 and 2, and can be determined based on two 
graph-theoretic structures [28]. The first one is the notion of 
paths with manageable interference , which captures when 
the two information flows can coexist and achieve a total of 
2 sum DoF. The second one is the notion of an omniscient 
node , which creates an informational bottleneck and limits 



the DoF to 1. Whenever neither of these structures is found, 
3/2 DoF are achieavable. The case of delayed CSIT was 
previously considered in [29]. Interestingly, it was shown 
that as long as no omniscient node is found, at least 4/3 
DoF are achievable. Hence, just as in the instantaneous CSIT 
case, the omniscient node is the key informational bottleneck 
whose absence determines when we can go beyond 1 DoF, 
or simple time-sharing. However, it is also known that unlike 
in the instantaneous CSIT case, networks with delayed CSIT 
may have 4/3 DoF. Two questions naturally arise: How 
much richer is the set of possible DoF values in the delayed 
CSIT case? And what are the new informational bottleneck 
structures that apply only to the case of delayed CSIT? 

In this paper, we make progress on both of these questions. 
First, we generalize the concept of an omniscient node 
and introduce the notion of an m-bottleneck node. When 
a two-unicast network contains an m-bottleneck node for 
destination di, the DoF are constrained as mDi + D 2 < m, 
where Di is the DoF for source-destination pair i. Second, 
we show that, for m G N, there exists a two-unicast network 
with an m-bottleneck node where careful use of the delayed 
CSIT can achieve (1 — 1/m, 1) DoF, or 2 — 1/m sum 
DoF, matching the sum DoF outer bound implied by the 
m-bottleneck. This establishes that, unlike in several recent 
DoF characterizations where the sum DoF are shown to only 
attain a small and finite set of values [ 8 ], [30], [31], the set 
of DoF values for two-unicast networks with delayed CSIT 
is in fact infinite. 


II. Problem Setting 

A multi-unicast wireless (Gaussian) network J\f = (G, L) 
consists of a directed graph G = (V,£), where V is the 
node set and £ c V x V is the edge set, and a set of 
source-destination pairs L C V x V. We will focus on 
two-unicast Gaussian networks, which means that L = 
{(si r di), (s 2 . d 2 )}, for distinct vertices si,s 2 ,di,d 2 £ V. 
Moreover, we will assume that the network is layered , 
meaning that the vertex set V can be partitioned into r 
subsets Vi, V 2 , • • •, V r (called layers) in such a way that E c 
U'=1 Vi x Vi+u and Vi = {si,s 2 }, V r = {di,d 2 }. For a 
vertex v G Vj, we will let X(v) = {u G Vj-\ : (u, v) G E} 
be the set of parent nodes of v. 

A real-valued channel gain hij [t] is associated with each 
edge (i, j) G E at each time t. We consider a fast-fading sce¬ 
nario, where the channel gains {hij[t ]}^ 0 for i, j G V are 
assumed to be mutually independent i.i.d random processes 
each obeying an absolutely continuous distribution with finite 
variance. At time t = 1,2,..., each node i G V transmits 
a real-valued signal Xi [t] , which must satisfy an average 
power constraint ^ J2t-i ^ [^ 2 [ m ]] < P, G V, for a 
communication block of length n. The signal received by 
node j at time t is given by 

Y M= E h iM x A t \ + z j[t], (!) 

where Zj[t] is the zero-mean unit-variance Gaussian noise 
at node j, assumed to be i.i.d. across time and across nodes. 


We will use Xf to represent the vector (Xi [0],..., Xi[n — 1]) 
and if A is a subset of the nodes, X&\t\ = (2Q[f] : i G A). 

We consider a delayed CSIT model where instantanta- 
neous CSI is only available at the receiver of a given 
channel, and is learned with a unit delay at other nodes. 
More precisely, we assume that at time t, a node k G V has 
knowledge of 


{hi,k[t\ : i £ X(k)} U {hij[m\ : (i, j) G £, 1 < m < t - 1}. 


We will use FP = (hij[m\ : (i,j) G £ , 1 < m < t) to de¬ 
note the random vector corresponding to the channel state 
information up to time t. We point out that other more 
restrictive delayed CSIT models where nodes learn channel 
gains with a longer delay, or with a delay that is proportional 
to how far a given channel is in the network [29] can be 
considered. However, it is straightforward to see that, through 
an interleaving operation, such models can be reduced to the 
model here considered. 

We will use standard definitions for a coding scheme, 
an achievable rate pair (Ri r R 2 ), and the capacity re¬ 
gion C(P) of a network AT. We say that the DoF pair 
(£> 1 , 1 ) 2 ) is achievable if we can find achievable rate pairs 
(Ri(P), R 2 (P)) such that 


D; = lim 


Ri(P) 

P^OO f log P 


The sum DoF D ^ is defined as the supremum of D 1 + D 2 
over achievable DoF pairs. 


III. Main Results 

Several recent works on the DoF characterization of 
multi-flow networks revealed a similar phenomenon: for 
(Lebesgue) almost all values of channel gains, the DoF 
are restricted to a small finite set of values. In [ 8 ] for 
instance, it is shown that D ^ G {1,3/2, 2} for two-unicast 
layered networks. When the secure DoF of two-unicast are 
considered instead, [30] showed that we must have D s G 
{0, 2/3,1, 3/2, 2}. In [31], two-source two-destination net¬ 
works with arbitrary traffic demands were instead considered, 
and the set of DoF values was shown to be {1,4/3,3/2, 2}. 
Finally, for the delayed CSIT setting considered in this paper, 
[29] showed that, if D ^ 7 ^ 1 , then D £ >4/3, suggesting 
that perhaps in this case, D ^ is also restricted to a small 
number of discrete values. 

In this paper, we show that this is not the case. In fact, 
we prove the following: 

Theorem 1: There exist two-unicast layered networks 
with delayed CSIT and sum DoF taking any value in the 
set 

5^ { 2(1 —:fc = l,2,...Ju{2}. (2) 

Intuitively, the reason why the sum DoF of two-unicast 
wireless networks can take all values in S is the fact that the 
delayed CSIT setting creates new informational bottlenecks 
in the network. In this work, we identify a class of such 
structures, which we term m-bottleneck nodes. We defer 





Fig. 1. Example of a network containing a bottleneck node (^ 4 ). 

the formal definition of an ra-bottleneck node to Section [V| 
but we describe its significance with an example. Consider 
the network in Fig. [I] If instantaneous CSIT were available, 
V 2 and vs could amplify-and-forward their received signals 
with carefully chosen coefficients so that their signals cancel 
each other at receiver i? 4 . This would effectively create an 
interference-free network, and the cut-set bound of 2 DoF 
would be achievable. However, when only delayed CSIT is 
available, such an approach is no longer possible. In fact, as 
we show in Section [V| V 4 functions as a 2 -bottleneck node 
for destination d\, causing the DoF to be constrained as 

2Di + D 2 < 2 . 

As it turns out, by utilizing delayed CSIT, the DoF pair 
( 1 / 2 , 1 ) can in fact be achieved. 

In general, we show that whenever a network contains an 
m-bottleneck node for destination di under the delayed CSIT 
assumption, we have 

mDi + Dj < m, (3) 

where we let i = 3 — i, and i = 1,2. We point out that, in 
the case m = 1 , a bottleneck node reduces to the omniscient 
node [7], [28], [29], which was known to be an informational 
bottleneck in two-unicast networks, even under instantaneous 
CSIT. 

In addition, we show that it is possible to build a two- 
unicast layered network where the outer bound implied by ([3]) 
is tight. In order to do so, we introduce linear achievability 
schemes that make use of delayed CSIT in order to reduce the 
effective interference experienced by the bottleneck nodes 
as much as possible. Theorem [I] then follows by noticing 
that if we have a network with an m-bottleneck node for 
d\ and an m-bottleneck node for d 2 , then we must have 
mfii + D 2 < m and D 1 + mD 2 < m, which implies 

(m -f l)(Di D 2 ) ^ 2 m D\ - 1 - D 2 — 2 — 2 /(m - 1 - 1 ). 

Showing that two-unicast networks exist where the bound 
above is tight implies Theorem [l] Before proving our main 
results, we present two motivating examples to describe the 
role of an m-bottleneck node. 

IV. Motivating Examples 

In this section, we first investigate the DoF of two net¬ 
works, through which we motivate the idea of a bottleneck 
node and illustrate the transmission strategies that take 
advantage of delayed CSIT. 


^1 V5 



Fig. 2. Motivating example: we show that for this network, using the 
delayed CSIT, we can achieve (D±, D 2 ) = (2/3,1). 

A. Network with a bottleneck node 

Consider the network depicted in Fig. [2] If instantaneous 
CSIT was available, , ^3 and u 4 could scale their signals 
according to using the information of h V2 ^ V5 , h V3 ^ 5 , and 
h VA , Vb so that their interference at U 5 is canceled. However, 
when CSIT is only available with a delay, such an approach 
does not work, and in order for information to flow from 82 
to d 2 > some interference must inevitably occur at v§. This 
suggests that v 5 plays the role of a informational bottleneck, 
and the sum DoF should be strictly smaller than 2. 

In this subsection, we show that for this network we can 
achieve (Di,D 2 ) = (2/3,1). To do so, it suffices to show 
that during three time slots source s\ can communicate two 
symbols to destination di, while source s 2 can communicate 
three symbols to destination d 2 . Since we can concatenate 
many three-slot communication blocks, we can describe our 
encoding as if the three time slots for the first hop occur 
first, followed by the three time slots for the second hop, 
and finally, the time slots for the third hop. By concatenating 
many blocks, the delay from waiting three time slots at each 
layer becomes negligible. Next, we describe the transmission 
strategy for each hop separately. We will ignore noise terms 
to simplify the exposition. 

Transmission strategy for the first hop: During the first 
two time slots each source sends out two symbols; source 
si sends out symbols a\ and a 2 , while source s 2 sends out 
symbols 61 and & 2 . During the third time slot, source si 
remains silent while source s 2 sends out one symbol denoted 
by 63 . We note that upon completion of these three time slots, 
relay v\ has access to symbols a\ and a 2 , and relay Vj has 
access to symbols 61 , 6 2 , and 63 , j = 2,3,4. 

Transmission strategy for the second hop: The key part of 
the transmission strategy happens in the second hop. During 
the first time slot, relay t > 2 transmits 61 , relay V 3 transmits 
6 2 , and relay v 4 transmits 63 as depicted in Fig. [3] Ignoring 
the noise terms, relay U 5 obtains a linear combination of 
the symbols intended for destination d 2 , L\ ( 61 , 62 , 63 ) that 
for simplicity we denote by L x (6). Similarly, relays vq and 
V 7 obtain linear combinations T 2 (6i,6 2 ) and 1 / 3 ( 62 , 63 ) 
respectively. During the first time slot, v\ remains silent. 

At this point, using the delayed knowledge of the channel 
state information, relay vs can (approximately) reconstruct 
Li( 6 ). During the second time slot, relays V 2 and u 4 remain 
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Fig. 3. Transmission strategy for the second hop of the network depicted 
in Fig. [2] 

silent, relay v\ sends out ai, and relay vs sends out Li(6) 
(normalized to meet the power constraint). This way, v$ 
obtains a linear combination of a\ and Li(6) denoted by 
La(cli, Li(b)). Note that U5 already has access to Li(6) and 
thus can recover a\. Also, note that v§ and v? obtain Li(6). 

Finally, during the third time slot, relays v 2 , vs and V4 
remain silent, and relay v\ sends out a 2 . Upon completion 
of these three time slots, U5 has access to a\ and a 2 , vq has 
access to L\(b) and L 2 (61,62)* and v 7 has access to Li(6) 
and I/ 3 (62,63). 

Transmission strategy for the third hop and decoding: 

The transmission strategy for the third hop is rather straight¬ 
forward. Relay ^5 sends a\ and a 2 to di, and relays v$ and v 7 
send three linearly independent equations la (6), L 2 (61,62), 
and 1/3(62,63) to d 2 . Therefore (. Di,D 2 ) = ( 2 / 3 , 1 ) DoF 
are achievable for the network of Fig. [ 2 ] 


Vi Vq 



Fig. 4. In this example, we show we can achieve (D 1 , D 2 ) = (1,1). 

B. Network with no bottleneck node 

In this subsection, we consider the network in Fig. [ 4 ] As 
in the previous example, the lack of instantaneous CSIT 
prevents nodes ^3, V4 and U5 from scaling their signals 
according to the channel gains of the second hop so that their 
interference at vq and v 7 is canceled. Therefore, interference 
between the information flows is unavoidable. However, as 
we will show, since there is no single node acting as a 
bottleneck node (as in the previous example), ( 1 , 1 ) DoF 
can be achieved. As it turns out, the diversity provided by 
an additional relay allows for a retroactive cancelation of the 
interference. 

The transmission strategy has three time slots and the 
goal is for each source to communicate three symbols to its 


Fig. 5. Transmission strategy for the second hop of the network depicted 
in Fig. [4] 

corresponding destination. For the first hop, the transmission 
strategy is very similar to that of the previous example and 
during each time slot, each source just sends a new symbol 
{ai s for source s\ and 6$’s for source s 2 , for i = 1 , 2 , 3 ). 

Transmission strategy for the second hop: Similar to the 
previous example, the key part of the transmission strategy 
is in the second hop and that is what we focus on. The 
transmission strategy is illustrated in Fig. [ 5 ] and described 
below. 

During the first time slot, relays v\ and v 2 remain silent. 
Relay ^3 sends out 61, relay v 4 sends out 62, and relay v$ 
sends out 63. Ignoring the noise terms, relay vq obtains a 
linear combination of all symbols intended for destination 
d 2 that we denote by la (6). Similarly, relay vj obtains 
L 2 (62, 63) and relay v$ obtains L 3 (6). 

At this point, using the delayed knowledge of the channel 
state information, relay U3 can reconstruct la (6) and relay 
U4 can reconstruct L 2 (62,63). During the second time slot, 
V3 sends out la (6) and this equation becomes available to 
relay v$. During this time slot, relay v\ sends out a\ and 
relay v 2 sends out a 2 . Note that due to the connectivity of 
the network, relay vq receives L 4 (ai,a 2 ), Za(6)^, and 
relay v7 receives Lq (ai,a 2 ). Using the received signals dur¬ 
ing the first two time slots, relay vq can recover L 5 (ai, a 2 ). 
Relays V4 and v$ remain silent during the second time slot. 

In the third time slot, relay v\ sends out a3, and relay 
V4 sends out £2(62,63). All other relays remain silent. 
This way, relays vq and obtain L 7 (c&3, L 2 (62,63)) and 
Lg (<23 5 £2 (62, 63)) respectively. Now note that using the 
received signal during time slots one and two, relay v 7 can 
recover <23. 

Transmission strategy for the third hop and decoding: 

In the third hop, relays vq and v 7 can easily communicate 
^5(^15^2), L6 (<21,(22), and <23 to destination d\ during 
the three time slots. Note that these equations are (with 
probability one) linearly independent, thus destination d\ can 
recover its symbols. A similar story holds for destination d 2 . 
This completes the achievability of (Di,D 2 ) = ( 1 , 1 ) for 
the network of Fig. [ 4 ] 

V. Bottleneck Nodes 

As shown in the previous section, for the network of Fig. | 4 | 
it is possible to exploit the diversity provided by the relays 
to retroactively cancel out the interference caused by relays 












vs, v 4 , and v$ at relays vq and v?. However, it is not difficult 
to see that the same approach cannot work for the network 
in Fig. [2] This suggests that the network in Fig. [2] contains 
an informational bottleneck that is not present in the network 
in Fig. [4] and that restricts the sum DoF to be strictly less 
than 2. As it turns out, this informational bottleneck is relay 
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Fig. 6 . V 5 acts as a bottleneck for the information flow. Relays V 2 , V 3 , 
and V 4 have to remain silent during a fraction of the time steps in order to 
allow vi and U 5 communicate. 

v$. Notice that the information flow from si to d\ must go 
through U 5 . Moreover, the fact that the information flow from 
S 2 to d 2 must go through the set of nodes Ai = {V 2 , ^ 3 , V 4 }, 
and CSIT is obtained with delay, makes interference between 
the flows unavoidable and relays V 2 , V 3 , and V 4 have to 
remain silent during several time slots in order to allow s± 
and d\ to communicate. As we will show in this section, the 
size of the set Ai determines how restrictive the bottleneck 
node v$ is. For the example in Fig. [bj since \A4\ = 3, the 
bottleneck node implies a bound of the form 3Di + D 2 < 3. 

Before stating the main lemma on bottleneck nodes, we 
need a few definitions. 

Definition 1: A set of nodes A, possibly a singleton, is a 
(B, C )-cut if the removal of A from the network disconnects 
all paths from B to C. 

Definition 2: A node v is an omniscient node if it is an 
({si,s 2 }, di) -cut and there is a node u G I(v) U {v} that is 
a (s-, {di,d 2 })-cut. 

The existence of an omniscient node imposes that the 
sum DoF is bounded by 1, even when instantaneous CSIT 
is available. For more information regarding the omniscient 
node, we refer the readers to [28], [29]. Motivated by the 
definition of an omniscient node, we introduce the notion of 
an m-bottleneck node, which reduces to an omniscient node 
in the case m = 1 . 

Definition 3: A node v G V is called an m-bottleneck 
node for di if it is an ({si, s 2 }, c^)-cut and there is a set 
Ai C X{y) that is an (s-, {d\, d 2 })-cut such that \Ai\ = m. 

Although a 1-bottleneck node for di is an omniscient node, 
the converse is not true. The following theorem provides 
an outer-bound on the DoF of a two-unicast network with 
delayed CSIT and an m-bottleneck node for di. 

Theorem 2: Suppose a layered two-unicast wireless net¬ 
work AT contains an m-bottleneck node for di, for i G { 1 , 2 }. 
Then under the delayed CSIT assumption, we have 

mDi + D~i < m. (4) 


For m = 1 the theorem follows since a 1-bottleneck node 
for d\ is an omniscient node. In the remainder of this section, 
we prove this result in the case m > 1. Suppose for network 
Af , we have a coding scheme that achieves (Di,D 2 ) and v 
is an m-bottleneck node for d\ in layer V 0 + 1 . We use the 
network of Fig. [ 6 ] In this network, it is straightforward to 
verify that node v is a 3-bottleneck node for d\, according 
to Definition [3] 

The proof contains two main steps, stated in two separate 
lemmas. First, we construct a physically degraded MIMO 
BC, A/bc, where it is possible to achieve any DoF pair 
(Di, D 2 ) that is achievable in the original network Af. Since 
the capacity of a physically degraded BC does not change 
with feedback, we can drop the delayed CSIT. The second 
step is then to show that, if no CSIT is available, 0 must 
be satisfied in A/bc, which must therefore be satisfied in Af 
as well. We next describe these two steps in more detail. 

We first construct the MIMO BC A/bc based on Af as 
follows. The layer in Af preceding the bottleneck node, Vi, 
will become a sinlge source s' with \Vi\ antennas. A/*bc will 
contain two receivers, namely d[ and d' 2 . 



Fig. 7. Constructed MIMO BC A/rc tor the network A f in Fig. [6] 

Receiver d[ has only one antenna, which is a replica of 
the bottleneck node v in Af. On the other hand, receiver d' 2 
has m receive antennas labeled as vi, u 2 ,..., v m . See Fig. [ 7 ] 
for a depiction. The first receive antenna of d' 2 , v\, has the 
same connectivity, channel realizations and noise realizations 
as that of node v G Af. This guarantees that Y v% = Y v , and 
that A/bc is physically degraded. The remaining antennas of 
d 2 each have statistically the same observation as d[ G Af', 
but with independent channel and noise realizations. 

Lemma 1: Any DoF pair (Di, D 2 ) achievable in Af is also 
achievable in A/bc- 

Proof: First we focus on the network Af, and assume 
we have a sequence of coding schemes that achieve a given 
rate pair (ith, R 2 ). Since node v is a bottleneck node for d\, 
it is an ({si, s 2 }, di)-cut and must be able to decode W\ as 
well, and we have 

H (Wi\Y™ , H n ) < ne n , (5) 

where e n G 0 as n A 00 , from Fano’s inequality. Next we 
notice that from the received signals in any given layer one 
should be able to reconstruct VF 2 , and we have 

H (w 2 \ [F M y w X M + Z Ve+1 ] n ,X^ c ,H n ) 







< H (w 2 |y^ + 1 ,H n ) < ne n , ( 6 ) 

where we let Fj^y l+1 be the transfer matrix between M 
and Vg+i, and M c = Vg \ M. Our goal will be to emulate 
network A f in the MIMO BC A/bc, so that destination 
d[ can recreate Yf to decode W\, and destination d ' 2 can 
approximately recreate [F M y l+ 1 X M + Z Vi+1 ] n and X n Mc 
to decode W 2 . 

The main idea is to have the source s' in A/bc simulate 
all the layers in J\f up to Vg. In order to do that, let’s 
first suppose that s' and the destinations can share some 
randomness, drawn prior to the beginning of communication 
block. This shared randomness corresponds to noise and 
channel realizations for the network J\f during a block of 
length n. Let us denote these noise and channel realizations 
with the random vector U. Notice that the channel and noise 
realizations in U are independent of the actual channel and 
noise realizations in A/bc- Using U and the messages W\ 
and W 2 , s' can transmit what the nodes in layer Vg from M 
would have transmitted (same distribution). 

Since the received signal at d[ has the same distribution 
as the received signal at v in network J\f, similar to ([5]), for 
A/bc, we have 

H (Wi|yj/,HB C ,u) < ne n . (7) 

Similarly, since the first antenna of d ' 2 receives the exact 
same signal as d[, we have 

H (w^.HScU) 

< H (W 2 \W U YI ,Hg c ,U) + H (Vi \Y% ,Hg c ,u) 

< H (WilWuY^U^v) +ne n . (8) 

Next we notice that in J\f, Xm c is only a function of U 
and W\. As a result, the source s' in A/bc can reconstruct 
Xm c and transmit it from the corresponding antenna in A/bc- 
This is because Ad is a ( 52 , {di, d 2 })-cut in J\f and there can 
be no path from 52 to M c . Therefore, if we let Fm,v £+1 be 
the transfer matrix drawn as part of the shared randomness 
U, and Zy £+1 be a noise vector identically distributed as 
Zy i+1 in M but independent from everything else, we have 


h(w 2 \W u 

Y%, Hg c ,u) 


£ 

II 

Fm,v £+ 1 Xm + Z Vi+1 

Hg c ,u) 

+ l(w 2 ; 

Fm,v £+ iXm + Z Ve+ 1 

" \ Y d / 2 > Wi, Hg c , u) 

<h(w 2I 

Fm,V£+iXm + Z Vi+1 


+ l(w 2 ; 

Fm,v £+ 1 Xm + Zv e+ 1 

" IVi' > Wi, Hg c , u) , 


( 9 ) 

and, from the first term above is upper-bounded by 
ne n . All we need to show is that the mutual information 
term in ^ is o(logP). Let Gm,cI ' 2 and Gm c 4' 2 be the 
transfer matrices from M and AP to d ' 2 in A/bc- Notice 
that Gm,cI ' 2 is an m x m matrix and is invertible with 


probability 1. Therefore, from Wi, Hbc, and U we can 
build Gm c ,( 1 ' 2 Xm c i and then use it to compute 

F M,V e +i G M,d' 2 ( Yd 2 ~ G Mc,d' 2 X M<=) 

= Fm,Vi + 1 Xm + Z, 


where Z is a combination of noise terms, whose power is 
a function of channel gains, but not of P. Therefore, the 
mutual information term in ([9]) can be upper bounded as 


~ Z n ) 


( 


F M , Vt+ 1 X M + Z Ve+ 1 J H | VJl,W 1 ,W 2 , Hg C ,u) 

(X) 


h 

= h(Z$ e+i — Z n ) ~ h 

= h(Zy (+i — Z n ) — h 
< no(logP), 


( 10 ) 


where the first equality follows since Fm,v £+ 1 Xm is j ust 
a function of Wi, W 2 and U. Therefore, from ([ 8 ]), ([9]) and 
m , we have 

H(W 2 1 YJt, H bc , U) < ne n + n o( log P). 

Hence, under the assumption of shared randomness, any pair 
(Pi,P 2 ) achievable on M is also achievable in A/bc- But 
since the shared randomness is drawn independently from 
W\ and W 2 , we can simply fix a value U = u for which 
the resulting error probability is at most the error probability 
averaged over U. Thus, the assumption of shared randomness 
can be dropped, and the lemma follows. ■ 

Lemma [T] allows us to bound the DoF of network J\f by 
instead bounding the DoF of A/bc- 

Lemma 2: For the MIMO BC A/bc, mDi + D 2 < m. 

Proof: The MIMO BC A/bc is physically degraded 
since the first antenna of d 2 observes the same signal 
as d[. We know that for a physically degraded broadcast 
channel, feedback does not enlarge the capacity region [32]. 
Therefore, we can ignore the delayed knowledge of the 
channel state information at the transmitter (i.e. no CSIT 
assumption). We can further drop the correlation between 
the channel gains of the first receiver and the first antenna 
of the second receiver, as the capacity of a BC only depends 
on the marginal distributions of the received signals. Thus 
for the MIMO BC described above under no CSIT, we have 


n (mRi + P 2 - e n ) 

m 

-Yl 1 ( Wi ; Y |h b C 


3 = 1 


)} +/(w 2 ; y||Hg c ) 




3 = 1 


Hi 


Y" |H bc 


3 = 1 




h (y%\W U Hg c ) - h (y%\W u W 2 , Hg c ) 












^m +2 


m 

= J 2 { h IHSc) - h (n" 1^1. HSc) } 

3 = 1 

+ h (y% \Wu Hg c ) Z? m |Hg c ) 

m 

= E { /l (n" IHSc) - h (z- |Hg c ) } 

i=i 



i=2 


m 

< E ^ ( F «” I R bc) - & |Hg c ) } 

J = 1 

< mn log P + o (log P)) , 
where (i) follows since 

m 

h (Y^\W lt Hg c ) - £ h {Y* \W it Hg c ) 

3 = 1 
m 

= Y h (Y \K > , W U HSc) - h (yj* I W u HS C ) 

i=2 

m 

= y i yju; n” i^i, Hg c ). 

3 = 2 

Therefore, we conclude that 



^ra+l 


Fig. 8. In this example, we show that we can achieve corner point 
(D 1 ,D 2 ) = ((m - 1 )/m, 1). 

bj -1 intended for destination d 2 » j = 2,3,.. M m + 1. Ig¬ 
noring the noise terms, relay Vj obtains a linear combination 
of the symbols intended for destination d 2 , (bj, 

j = m + 2,..., 2m + 1. 



mD\ + Do < m, 


( 12 ) 


which completes the proof of Lemma [2] ■ 

Now once again, consider the network of Fig. [2] In this 
network, t ? 5 is a 3-bottleneck node for di. Thus for this 
network, using Theorem |2] we have 


0 < A < 1, i = l,2, 

3 Pi -f- P 2 ^ 3 . 


(13) 


In Section [IV| we provided the achievability proof of comer 
point (A, A) = (2/3,1). As a result, the outer-bound 
provided by Theorem [2] (alongside individual bounds) com¬ 
pletely characterizes the achievable DoF region in this case. 


VI. Proof of Theorem \T\ 

In this section, we describe the proof of Theorem [T In 
essence, we show that the example considered in Section 
|A| can be generalized to a class of networks that contain 
bottleneck nodes whose corresponding outer bounds can be 
achieved. 

First we consider the network illustrated in Fig. [ 8 ] In this 
network, u m+2 is an m-bottleneck node for di, and there 
is no bottleneck node for d 2 . We show that we can achieve 
corner point (A, A) = ((m — l)/m, 1). The achievability 
strategy is a generalization of the strategy presented for 
the network of Fig. [2j and uses m time steps. As in that 
case, the transmission scheme for the first and third hops is 
straightforward and we only focus on the second hop. 
Transmission strategy for the intermediate problem: The 
transmission strategy has m time slots. During the first time 
slot, relay v\ remains silent and relay Vj sends out symbol 


Fig. 9. Relay v\ is an m-bottleneck node for d\ and relay v 2 is an 
m-bottleneck node for d 2 . 

At this point, using the delayed knowledge of the channel 
state information, relay v 2 can (approximately) reconstruct 
Li(b). During the second time slot, relay v\ sends out a±, 
relay v% sends out A (6) (normalized to meet the power 
constraint), and relays V 3 ,..., v m +i remain silent. This way, 
v m +2 obtains a linear combination of a\ and L\(b) denoted 
by P m +i(ai, Li(b)). Note that u m+2 already has access 
to Li(b) and thus can recover a\. Also, Li(b) becomes 
available to Vj for j = 2 ,3,..., m + 1. 

During time slot i, £ = 3, ...,m, relay v\ sends out 
ci £-1 and relays u 2 , U 3 ,..., u m+ i remain silent. Note that 
with this strategy, u m+2 obtains ai, a 2 ,..., a m _i, and re¬ 
lays u 2 , U 3 ,..., Vm+i (with probability 1 ) obtain m linearly 
independent combinations of fri, 6 2 , ..., b m . Then the task 
for the third hop is to simply deliver ai,..., a m _ 1 to d\ and 
the m linearly independent combinations of b\ ,..., b m to d 2 . 

Since we have matching inner and outer bounds, we 
conclude that for the network in Fig. [8j the sum DoF are 
A = 1 + (m — \)/m = 2 — 1/m, for m G {1, 2,...}. Notice 
that this corresponds to half of the values in the set S in 
To obtain the remaining values in 5, we need a class of 
networks that contain both a bottleneck node for d\ and a 
bottleneck node for d 2 . 

Consider the network depicted in Fig. [9] For simplicity of 
notation, we have only labeled a few relays in this network. 
We claim that for this network A = 2 mn/{m + 1), m G 
{1,2,...}. First, we prove the converse. It is straightforward 
to verify that relay v\ is an m-bottleneck node for d\ 






and relay v 2 is an m-bottleneck node for d 2 . Thus, from 
Theorem [2] we have 

mDi + D~ { < m, i = 1, 2. (14) 


To prove that the outer-bounds are tight, it suffices 
to prove the achievability of corner point (Z>l,D 2 ) = 

(m/(m + l),m/(m+ 1)). 

Transmission strategy: The goal is to deliver m symbols to 
each destination during m + 1 time slots. Denote the symbols 
intended for d\ by a^s and the symbols intended for d 2 
by bi s, i = 1, 2,... 5 m. We point out that the network in 
Fig. [9] can be seen as a concatenation of the network in 
Fig. [ 8 ] with flipped copy of itself. Hence, we will describe 
the achievability in terms of each of the two subnetworks. 
We first describe how to deliver s to relay ^3 and s to 
relay V 4 . Then, the goal becomes for relay v% to deliver a^’s 
to d\ and for relay U 4 to deliver b^’s to d 2 , i = 1 , 2, ..., m. 
Since the two subnetworks are essentially identical, we only 
need to show that we can deliver s to relay v% and Vs 
to relay V 4 during m + 1 time slots. Then, the relays in the 
second subnetwork will implement a similar strategy to that 
of the nodes in the first subnetwork. 

Since the first subnetwork is identical to the network of 
Fig. [ 8 ] by using the same strategy, during m time slots we can 
deliver m — 1 symbols to v% and m symbols to V 4 . During 
the last time slot, i.e. time slot m + 1 , source s 2 remains 
silent, and source s\ sends out one more symbol, a m , to 
relay ^ 3 . This way, we successfully deliver Vs to relay ^3 
and Vs to relay V 4 during m + 1 time slots, i = 1 , 2,. .., m. 
Repeating the same strategy over the second subnetwork, 
each destination can decode its m symbols over m + 1 time 
steps, and we conclude that D ^ = 2m/(m+l) = 2—2/(m+ 
1). This completes the proof of Theorem [I] 


VII. Discussion 


In this paper, we introduced a new technique to derive 
outer bounds on the DoF of two-unicast wireless networks 
with delayed CSIT, and we presented several transmission 
strategies that can achieve these outer bounds. The presented 
transmission strategies achieve the optimal DoF in a finite 
number of time slots. In this section, we discuss two follow¬ 
up questions to our main results: 

(a) Do bounds of the form mDi + D • < m for m > 1 
suffice to characterize the DoF region of the two-unicast 
wireless networks with delayed CSIT? 

(b) Can we achieve the optimal DoF region of a two-unicast 
wireless networks with delayed CSIT in a finite and 
bounded number of time slots? 

As it turns out, the answers to the questions posed above 
are both negative. To provide some insights, we consider 
the network depicted in Fig. [TO] Under instantaneous CSIT 
assumption, the DoF region of this network is derived in [ 8 ] 
and is given by 


1 , 2 , 


vi 



Fig. 10. The outer-bound provided by Theorem [2] does not describe the 
DoF region of this network. Moreover, the achievability strategy for the 
corner points of the DoF region does not have a finite number of time slots. 


Interestingly, under the delayed CSIT assumption, we can 
still achieve this region. However, the network in Fig. [10] 
contains no bottleneck nodes. Moreover, it can be verified 
that the region in (13] ) cannot be obtained from bounds of 
the form mDi + Dj < m for m > 1 . 

Next, we briefly describe the achievability strategy for 
corner point (Di, D 2 ) = (1,1/2). The achievability strategy 
goes over 2 m + 1 time slots and upon completion of the 
transmission, we achieve 


(D 1 ,D 2 ) 


2 m k \ 
2 m + 1 ’ 2 m + 1 / 


( 16 ) 


where m is an arbitrarily chosen parameter. Thus, as the 
number of time slots m goes to infinity, we achieve arbitrarily 
close to the comer point (Di,D 2 ) = (1,0.5). 

The transmission strategy is illustrated in Fig. El We 
highlight the important aspects of this strategy here. First 
we note that by interleaving different blocks, we encode such 
that the first 2 m time slots of the first hop occur before the 
first time slot of the second hop. This way, there will be no 
issues regarding causality in the network. 

For the first hop, the communication during the first 2m 
time slots is straightforward. In the second hop during the 
first time slot, relays v\ and v 2 create random linear combi¬ 
nations of all the signals they received during the first 2 m 
time slots of the first hop and send them out. Destination one 
obtains L 2m+ 3 (a) + L 2 m + 4 :{b), and destination two obtains 
L 2 m+ 5 (a) + T 2m+6 (5). Our goal is to deliver L 2m+4 (5) to 
both receivers. Relay vs can reconstruct L 2 m+ 4 ( 6 ), however, 
there is no link form v% to destination one. As a result, during 
the final time slot, the second source sends out T 2 m+ 4 ( 5 ) and 
this signal becomes available to all receivers (see Fig. El 
where L 2 m+ 4 (5) is highlighted by a red oval). 

The key idea for the achievability would be the observation 
that relay v\ can combine its previous observations in a 
way that b^ s form L 2 rn+ ^(b), i = 1,2, ...,m. This way, 
during the first two time slots, the interference at destination 
one would be the same. Thus, if we provide L 2 rn +A(b) to 
destination one, it can recover L 2m+3 (a) and L 2m+7 (a). 
Finally, we note that L 2 m+ 6 ( 6 ) is linear combination of Vs 
that destination two obtains during the first time slot. 

Upon completion of the transmission strategy, destination 
one has access to 


0 < Di < 1 , 

D\ + D 2 < §. 


(15) 


Ui, U 2 , . . • , <2 2m _ 2 , L 2rn -)_3 (d) , Z/ 2m _|_7 (u). (17) 




t = m+2,...,2m+l 
G2m-2, • • • ,Om-1 



+m+4 (fj 
l j ‘2n< - 1 ^ /) 
L 2m+4 (&) 
t = 3 

^2m+4 (^) 
^2m+4 (&) 


15-c 4,...,m+l 


ai,..., a m _2 


& m _ 2 


'i = m+2,...,2m+l 


-!>•••> O j 2m~2 


Fig. 11. Achievability strategy for corner point (D±, D 2 ) = (1, 1 / 2 ) of the DoF region of the network of Fig. [To] The achievability strategy uses 2m + 1 
time slots and as m -+ 00 , we achieve the desired corner point. 


Hence, receiver one has enough equations to recover its 
intended symbols. Similarly, destination two has access to 

^lj ^ 2 ? • • • 5 ^ m — 2 : -^2771+4(^)5 -^2771+6(^)5 ( 18 ) 

which allows destination two to recover its intended sym¬ 
bols. 
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